-
Notifications
You must be signed in to change notification settings - Fork 25.5k
[INFERENCE] Remove node pinning logic #134630
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Pinging @elastic/ml-core (Team:ML) |
* @param divisor - divisor to divide the initial requests per time unit by | ||
*/ | ||
public synchronized void updateTokensPerTimeUnit(Integer divisor) { | ||
private synchronized void updateTokensPerTimeUnit(Integer divisor) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@jonathan-buttner is this code and the related updateRateLimitDivisor()
functions still required?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
No I'm pretty sure we only added for the node pinning so we should be able to remove it
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
👍 removed in 93f0f19
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should we also remove the routed logic here: https://github.com/elastic/elasticsearch/blob/main/x-pack/plugin/core/src/main/java/org/elasticsearch/xpack/core/inference/action/BaseInferenceActionRequest.java#L26
Good catch thanks. Removed in ab7e442 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks Dave!
Developed for EIS and no longer required
Request routing logic was added to enable per node rate limiting with the Elastic Inference Service but that code remained behind a feature flag and was never enabled. EIS now implements rate limiting make the code redundant.